Weasels, Hedges and Peacocks: Discourse-level Uncertainty in Wikipedia Articles
نویسنده
چکیده
Uncertainty is an important linguistic phenomenon that is relevant in many areas of language processing. While earlier research mostly concentrated on the semantic aspects of uncertainty, here we focus on discourseand pragmaticsrelated aspects of uncertainty. We present a classification of such linguistic phenomena and introduce a corpus of Wikipedia articles in which the presented types of discourse-level uncertainty – weasel, hedge and peacock – have been manually annotated. We also discuss some experimental results on discourse-level uncer-
منابع مشابه
Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features
We investigate the automatic detection of sentences containing linguistic hedges using corpus statistics and syntactic patterns. We take Wikipedia as an already annotated corpus using its tagged weasel words which mark sentences and phrases as non-factual. We evaluate the quality of Wikipedia as training data for hedge detection, as well as shallow linguistic features.
متن کاملAnalyzing Iran Daily and US Today in Terms of Meta-Discourse Elements
The role of using meta-discourse elements in writing, especially in research newspapers, is so important that their authors can convey certainty, doubt, and characteristics of the writers in their writings. There are different meta-discourse markers used by various authors in different branches; for example, hedges and boosters are the most important devices in writing. The meta-discourse eleme...
متن کاملThe Use of Hedging in Discussion Sections of Applied Linguistics Research Articles with Varied Research Methods
The discourse of the discussion in research articles is regarded to be of considerable significance—as in this section the findings are interpreted in light of previous research and the authors’ argumentations are put forward as a major contribution (see Hyland, 1999). For this reason, the content and structure of the discussion section have been explored in several studies; however, little att...
متن کاملHedges and Boosters in Academic Writing: Native vs. Non-Native Research Articles in Applied Linguistics and Engineering
The expression of doubt and certainty is crucial in academic writing where the authors have to distinguish opinion from fact and evaluate their assertions in acceptable and persuasive ways. Hedges and boosters are two strategies used for this purpose. Despite their importance in academic writing, we know little about how they are used in different disciplines and genres and how foreign language...
متن کاملAdvertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles
When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...
متن کامل